Media recognition and synchronisation to a motion signal

ABSTRACT

The present document describes a device and method for synchronizing a motion signal corresponding to a media content with a media signal for the media content, the motion signal being for controlling a motion feedback system. The method comprises: receiving a portion of the media signal; obtaining a fingerprint corresponding to the received portion of the media signal; from reference fingerprints associated with time positions of at least one reference media content, identifying a reference time position of the media content corresponding to the obtained fingerprint; obtaining the motion signal associated with the identified reference time position of the media content; and outputting the motion signal synchronized with the media signal using the identified reference time position of the media content for controlling the motion feedback system.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is a continuation-in-part of U.S. non-provisional patent application Ser. No. 15/472,929 filed on Mar. 29, 2017, which is a continuation of U.S. non-provisional patent application Ser. No. 14/248,800 filed on Apr. 9, 2014, which is a continuation of U.S. non-provisional patent application Ser. No. 13/180,923 filed on Jul. 12, 2011 which claims priority of U.S. provisional application 61/366,301 filed Jul. 21, 2010 and entitled “MEDIA RECOGNITION AND SYNCHRONISATION TO A MOTION SIGNAL”, the content of which is incorporated herein by reference.

TECHNICAL FIELD

This description relates to the field of media recognition and synchronization. More particularly, this description relates to methods and devices for playing video and audio stream media in synchronization with a motion signal.

BACKGROUND

In controlling motion feedback systems for use in public, commercial or home environments, the challenge is in the synchronization of a motion signal for gaming, feature length movies, amusement park rides, virtual reality, etc. with audio or video signals during playback, such that the motion signal is delivered to motion feedback systems in time for the platform movement to match the reproduced audio-visual signals. The required tolerance in usually in the order of milliseconds.

In some applications, the motion signal is not transmitted synchronously with audio or video signals; the playback device must retrieve the motion signal separately, and synchronize them for playback with the audio or video signals. If the identity of the audio or video signals and position in time within the audio or video signals can be readily and accurately identified through metadata in the audio or video signals, the playback device may use this information to retrieve and synchronize the appropriate motion signal. If such metadata is not available, then another solution is required.

There is therefore a need for an improved synchronization method and device which addresses the above-noted challenges.

SUMMARY

The present disclosure therefore provides a motion signal synchronization method and device that overcomes or mitigates one or more disadvantages known to be associated with the prior art, or at least provides a useful alternative.

The proposed approach is based on a technique commonly referred to as media fingerprinting. This technique enables media recognition by way of deriving unique identifiers, which are typically referred to as media fingerprints, from particular media content. The correspondence between particular media fingerprints and particular media content is then used for identification purposes. For example, the recognition of audio and video content exploits acoustic and visual features to identify particular corresponding acoustic and video media respectively. Known media fingerprinting techniques are described in the following patent references: WO2002065782 (Philips), US20060041753 (Philips), and US20060075237 (Philips).

In light of the above media fingerprinting technique, the motion signal synchronization herein proposed employs media fingerprinting to recognize a given portion of an audio or video signal, to which is then associated a given motion signal for use during playback.

It was found that applying media fingerprinting techniques to identify a reference time position of the media content (e.g., the time position in a movie which is presently being read or played) and then obtain the motion signal which corresponds to this reference time position provides a satisfactory result since the media content and the motion signal can then be outputted synchronously. This combination provides a simpler, more precise, more robust, more stable, less expensive, more fault-tolerant solution that those described in the prior art for synchronizing a motion signal with a media signal. In contrast to prior art method, this method is completely independent of the method used for encoding, compressing or packaging the audio signal. Because this method relies on the acoustic characteristics of the audio signal, it can be made resilient to changes in the pitch, speed or amplitude thereof.

Motion feedback systems are meant to include motion platforms, motion actuators, motion simulator, vibrotactile transducers, and inertial shakers.

Motion platforms are meant to include any platform or seating arrangement to which motion is induced by actuators and on which a person is installed. Motion platform applications include, but are not limited to, gaming, feature length movies, amusement park rides, virtual reality, etc. An example of a motion platform include seat or chair for one or more persons on which are mounted one or more actuators which interface with the ground. Another example would be a platform for receiving a seat, chair or other device accommodating a user, and on which are mounted one or more actuators which interface with the ground.

Motion codes, a motion code signal or a motion code stream contain motion effects, or like viewer-effect signals contain other viewer-effects such as subtitles, alternative languages, audio track for deaf people, or people with hearing impairment, based on the content meant to be reproduced to the viewer. In most cases, the motion signals are prepared by motion designers. In other cases, the motion signals or portion thereof are generated automatically.

A media signal is to be understood as comprising an audio signal and/or a video signal.

An audio signal or an audio stream is to be understood as encompassing sound information meant to be reproduced to the during the presentation, such as the sound track of games, feature length movies, amusement park rides, virtual reality environments, etc. An example of an audio signal or an audio signal is encoded Pulse Code Modulation (PCM) samples.

A video signal or a video stream is to be understood as encompassing visual information meant to be reproduced to the user, such as the main picture of games, feature length movies, amusement park rides, virtual reality environments, etc. An example of a video signal or a video signal is an MPEG-4 Part 10 video elementary stream.

Media content is meant to include games, feature length movies, amusement park rides, virtual reality environments, advertisements, etc.

Feature length movies are those intended to be shown in movie theaters or home theaters. They last generally more than 30 minutes and have separate video and audio tracks to which a motion signal, a stream of motion codes or a motion code track can be associated.

According to an embodiment, there is provided a method for synchronizing a motion signal corresponding to a media content with a media signal for the media content, the motion signal for controlling a motion feedback system, the method comprising: receiving a portion of the media signal; obtaining a fingerprint corresponding to the received portion of the media signal; from reference fingerprints associated with time positions of at least one reference media content, identifying a reference time position of the media content corresponding to the obtained fingerprint; obtaining the motion signal associated with the identified reference time position of the media content; and outputting the motion signal synchronized with the media signal using the identified reference time position of the media content for controlling the motion feedback system.

According to another embodiment, there is provided a method for determining if a motion signal corresponding to a media content is synchronized to a media signal for the media content, the motion signal for controlling a motion feedback system, the method comprising: receiving a portion of the media signal; obtaining a fingerprint of the received portion of the media signal, namely an obtained fingerprint; retrieving a fingerprint of the portion of the media signal at a reference time position of the media content, namely a reference fingerprint; comparing the obtained fingerprint and the reference fingerprint; based on the comparing, determining if the motion signal is synchronized to the media signal; and according to the determining, outputting the motion signal synchronized with the media signal for controlling the motion feedback system.

According to another embodiment, there is provided a device for synchronizing a motion signal corresponding to a media content with a media signal for the media content, the device having access to a storage medium which comprises reference fingerprints associated with time positions of at least one reference media content, the motion signal for controlling a motion feedback system, the device comprising: an input unit for receiving a portion of the media signal; a processing device in communication with the input unit and the storage medium; and a memory accessible by the processing device, the memory storing instructions for implementing the processing device to: obtain a fingerprint corresponding to the received portion of the media signal; from reference fingerprints associated with time positions of at least one reference media content, identify a reference time position of the media content corresponding to the obtained fingerprint; obtain the motion signal associated with the identified reference time position of the media content; and output the motion signal synchronized with the media signal using the identified reference time position of the media content for controlling the motion feedback system.

According to another embodiment, there is provided a non-transitory computer readable memory having recorded thereon statements and instructions for execution by a computer to carry out a method for synchronizing a motion signal corresponding to a media content with a media signal for the media content, the motion signal for controlling a motion feedback system, the method comprising: receiving a portion of the media signal; obtaining a fingerprint corresponding to the received portion of the media signal; from reference fingerprints associated with time positions of at least one reference media content, identifying a reference time position of the media content corresponding to the obtained fingerprint; obtaining the motion signal associated with the identified reference time position of the media content; and outputting the motion signal synchronized with the media signal using the identified reference time position of the media content for controlling the motion feedback system.

BRIEF DESCRIPTION OF THE DRAWINGS

Further features and advantages of the present disclosure will become apparent from the following detailed description, taken in combination with the appended drawings, in which:

FIG. 1 is a flowchart of a method for synchronizing a motion signal to an audio signal, in accordance with an embodiment;

FIG. 2 is a flowchart which illustrates an embodiment of a method for determining if a motion signal corresponding to a media content is synchronized to a media signal for the media content; and

FIG. 3 is a schematic illustration of a device for synchronizing a motion signal to an audio signal, in accordance with an embodiment.

It will be noted that throughout the appended drawings, like features are identified by like reference numerals.

DETAILED DESCRIPTION

Referring now to the drawings, and more particularly to FIG. 1, there is illustrated an embodiment of a method 20 for synchronizing a viewer-effect signal such as a motion signal corresponding to a media content, for example from a feature length movie or amusement park rides, with a media signal for the media content. The motion signal is for controlling a motion feedback system. The viewer-effect signal may be continuous and may also be subtitles or other visual signals, additional audio tracks such as a track of alternative language, an audio track for the visual impaired by which an additional narrative is added to provide context (e.g., “car is arriving”), track for deaf people, or people with hearing impairment, such as subtitles, sign language, or written narrative describing a sound (e.g., “explosion sound”). According to an embodiment, the media signal is an audio signal and/or a video signal. For simplicity, the methods described herein will refer to motion signal. However, all methods also apply to other types of viewer-effect signals such as those listed above, not only motion signal.

In step 22, a portion of the media signal is received. For example, the media signal is received from a microphone, for instance in ambient. In step 24, a fingerprint corresponding to the received portion of the media signal is obtained.

According to an embodiment, the step of obtaining a fingerprint corresponding to the received portion of the media signal (step 24) comprises computing the fingerprint of the received portion of the media signal. In the computing the fingerprint step, the fingerprint of the portion of media signal may be generated as a code that uniquely corresponds to a waveform of the media signal. Upon generating such a unique code (i.e., a coded identifier), the corresponding waveform of the media signal from which the fingerprint was generated is thereafter identifiable by referencing to its fingerprint as described herein.

In step 26, from reference fingerprints associated with time positions of at least one reference media content, a reference time position of the media content corresponding to the obtained fingerprint is identified.

More specifically, in one embodiment of step 26, a database (aka a storage medium) of coded identifiers for the fingerprints for the media signal at the multiple playback time positions is consulted. The fingerprint of the portion of the media signal is matched to a reference fingerprint associated to the reference time position within the media content.

According to an embodiment, the identifying a reference time position of the media content (step 26) takes place on a remote networked server (not shown).

According to an embodiment, method 20 further comprises receiving the identity of one reference media content of the at least one reference media content to which the media signal corresponds prior to the identifying a reference time position of the media content (step 26). The step of identifying the reference time position of the media content is performed using reference fingerprints associated with time positions of the identified reference media content.

Method 20 may further comprises receiving at least one reference fingerprint associated with the outputted motion signal and stopping the output of the motion signal when the obtained fingerprint does not match any of the at least one reference fingerprint.

In another embodiment of step 24, the obtaining the fingerprint comprises retrieving the coded identifier for the fingerprint of the portion of the media signal at the reference time position directly from the motion signal.

In step 28, the motion signal associated with the identified reference time position of the media content is obtained. According to an embodiment, the motion signal associated with the identified reference time position of the media content is obtained from a remote networked server (not shown).

According to an embodiment of step 28, the identifying the current playback time position comprises continuously performing the obtaining a fingerprint, the identifying a reference time position, and the obtaining the motion signal steps to keep outputting the motion signal synchronized with the media signal, thereby ensuring continuous synchronization of the motion signal.

According to another embodiment of step 28, the identifying the current playback time position comprises performing the obtaining a fingerprint, the identifying a reference time position, and the obtaining the motion signal at a given frequency to resynchronize the outputting the motion signal with the media signal.

The reference time position refers to the location of the media signal within the at least one media content which is identified as corresponding to media content which is currently being played (corresponding to the obtained fingerprint) on either one of the speakers (for audio), the screen or display (for video) and the motion feedback system (for motion).

Step 30 involves outputting the motion signal, using the identified reference time position of the media content, synchronized with the media signal. The motion signal is eventually sent to a motion feedback system having actuators for moving the motion feedback system according to the motion signal. The viewer-effect signal is sent to the appropriate apparatus for playback. This may include a portable device (e.g., smart phone, tablet, dedicated audio device, seat monitor) that displays the additional written data (e.g., for the hearing impaired) or outputs the additional audio data (e.g., for the visually impaired).

According to an embodiment, method 20 further comprises receiving a motion signal corresponding to the media content prior to the identifying a reference time position of the media content (step 26), wherein the reference fingerprints are retrieved directly from the received motion signal.

According to an embodiment, the synchronized motion signal is stored in a storage device for eventually being sent to the motion feedback system.

The method 20 is performed, in one example, in real-time, as the media signal is being played.

Still according to another embodiment, method 20 further comprises receiving at least one reference fingerprint associated with the outputted motion signal and stopping the output of the motion signal when the obtained fingerprint does not match any of the at least one reference fingerprint.

FIG. 2 illustrates an embodiment of a method 100 for determining if a motion signal corresponding to a media content is synchronized to a media signal for the media content.

In step 102, a portion of the media signal is received for instance with a microphone. In step 104, a fingerprint of the media signal for the current playback time position is obtained (i.e., an obtained fingerprint).

In step 106, a fingerprint of the portion of the media signal at a reference time position of the media content is retrieved based on the current position within the viewer-effect signal, namely a reference fingerprint.

In step 108, the reference fingerprint of step 106 is compared with the obtained fingerprint of step 104. In step 110, based on the comparing, it is determined if the motion signal is synchronized to the media signal If the two fingerprints differ, the motion signal and media signal are not synchronized. This can occur, for instance, if the position within the motion signal changes abruptly, such as during a fast-forward or chapter-skip operation. The system may automatically choose as a result to apply method 20 to recover synchronization.

In another embodiment of step 104, the obtaining the fingerprint comprises retrieving the coded identifier for the fingerprint of the portion of the media signal at the reference time position directly from the motion signal.

In step 112, the motion signal synchronized with the media signal using the identified reference time position of the media content for controlling the motion feedback system is outputted according to the result of the determining if the motion signal is synchronized with the media signal (step 110).

Now referring to FIG. 3, there is shown an embodiment of a device 50 for synchronizing a motion signal for a feature length movie with a media signal.

The device 50 has an input unit 52; a processing unit 54; a memory 56; one or more databases (here databases 58 and 60); and an output unit 62 in communication with a motion feedback system 64 having actuators 1 to N, 66 to 68.

The input unit 52 is for receiving a portion of a media signal.

The databases 58 and 60 are used respectively for storing motion signals and reference media signal fingerprints in association with particular known media content (e.g., a known feature length movie). Such reference fingerprints are identifiers already derived for the known media content, and which are known to be uniquely related to the known media content.

The processing unit 54 is in communication with the input unit 52 and the databases 58 and 60, as well as with the memory 56. The memory 56 is thus accessible by the processing unit 54.

In one embodiment, the memory 56 is used to store instructions for implementing the processing unit 54 to perform a series of steps on the media signal which involve access and retrieval of data from the databases 58 and 60, as well as processing steps allowing the synchronisation of a motion signal with the media signal. The processing unit 54 is also in communication with the output unit 62 such that the motion signal, once retrieved and synchronized, are sent out either for storage on a database (not shown), or directly to a motion feedback system 64 during playback of the media signal.

Once the portion of the media signal is received at the processing unit 54, via its transmission from the input unit 52, the processing unit 54 obtains a fingerprint corresponding to the received portion of the media signal.

The processing unit 54 then proceeds to identify, from reference fingerprints associated with time positions of at least one reference media content, a reference time position of the media content corresponding to the obtained fingerprint. For example, to perform such comparison, the processing unit 54 accesses the database 60 of reference fingerprints; parses them via optional retrieval of data to an optional internal memory (not shown) until a match is detected between a reference media signal fingerprint and the fingerprint of the media signal. When no match is detected, an error signal is sent out to a user interface (not shown).

Then the processing device obtains the motion signal associated with the identified reference time position of the media content.

Then, based on the identified reference time position of the media content, the processing device is able to synchronize and output the motion signal with the media signal. For example, to do so, the processing unit 54 retrieves and sends the motion signal from the database 58 starting at the reference time position that corresponds to the reference time of the media signal received at input unit 52. The motion signal which is sent at output unit 62 is therefore synchronized with the playback time position of the media signal.

According to another embodiment, there is provided a non-transitory computer readable memory (not shown) having recorded thereon statements and instructions for execution by a computer to carry out a method for synchronizing a motion signal corresponding to a media content with a media signal for the media content. The motion signal is for controlling a motion feedback system. The method comprises method 20 shown in FIG. 1.

Finally, according to another embodiment, there is provided a non-transitory computer readable memory (not shown) having recorded thereon statements and instructions for execution by a computer to carry out a method for determining if a motion signal corresponding to a media content is synchronized to a media signal for the media content. The motion signal is for controlling a motion feedback system. The method comprises method 100 shown in FIG. 2.

While embodiments have been described above and illustrated in the accompanying drawings, it will be evident to those skilled in the art that modifications may be made therein without departing from the scope of this disclosure. Such modifications are considered as possible variants comprised in the scope of the disclosure. 

1. A method for synchronizing a viewer-effect signal corresponding to a media content with a media signal for the media content, the viewer-effect signal associated to the media content, the viewer-effect signal for producing an effect for a viewer, the method comprising: receiving a portion of the media signal; obtaining a fingerprint corresponding to the received portion of the media signal; from reference fingerprints associated with time positions of at least one reference media content, identifying a reference time position of the media content corresponding to the obtained fingerprint; obtaining the viewer-effect signal associated with the identified reference time position of the media content; and outputting the viewer-effect signal synchronized with the media signal using the identified reference time position of the media content for producing the effect in synchronization with the media content.
 2. The method of claim 1, further comprising receiving the identity of one reference media content of the at least one reference media content to which the media signal corresponds prior to the identifying a reference time position of the media content, and wherein the step of identifying a reference time position of the media content is performed using reference fingerprints associated with time positions of the identified reference media content.
 3. The method of claim 2, further comprising receiving at least one reference fingerprint associated with the outputted viewer-effect signal and stopping the output of the viewer-effect signal when the obtained fingerprint does not match any of the at least one reference fingerprint.
 4. The method of claim 1, further comprising receiving at least one reference fingerprint associated with the outputted viewer-effect signal and stopping the outputting the viewer-effect signal when the obtained fingerprint does not match any of the at least one reference fingerprint.
 5. The method of claim 1, wherein the media signal comprises at least one of an audio signal and a video signal.
 6. The method of claim 5, wherein the media content comprises a feature length movie and the audio signal and the video signal comprise the audio signal and the video signal for a feature length movie.
 7. The method of claim 1, wherein the obtaining a fingerprint corresponding to the received portion of the media signal comprises computing the fingerprint of the received portion of the media signal.
 8. The method of claim 7, wherein the computing the fingerprint comprises applying a media fingerprinting technique to generate the fingerprint as a coded identifier uniquely representative of the media content at the reference time position.
 9. The method of claim 8, wherein the identifying a reference time position of the media content comprises consulting a database comprising coded identifiers for the reference fingerprints to which are associated reference time positions and obtaining the reference time position that corresponds to the coded identifier uniquely representative of the media content at the reference time position.
 10. The method of claim 1, wherein the identifying the reference time position comprises continuously performing the obtaining a fingerprint, the identifying a reference time position, and the obtaining the viewer-effect signal steps to keep outputting the stream of viewer-effect signal synchronized with the media signal, thereby ensuring continuous synchronization of the viewer-effect signal.
 11. The method of claim 1, wherein the identifying the reference time position comprises performing the obtaining a fingerprint, the identifying a reference time position, and the obtaining the viewer-effect signal at a given frequency to resynchronize the outputting the viewer-effect signal with the media signal.
 12. The method of claim 1, further comprising receiving a viewer-effect signal corresponding to the media content prior to the identifying a reference time position of the media content, wherein the reference fingerprints are retrieved directly from the received viewer-effect signal.
 13. The method of claim 1, wherein the identifying a reference time position of the media content takes place on a remote networked server.
 14. The method of claim 1, wherein the viewer-effect signal associated with the identified reference time position of the media content is obtained from a remote networked server.
 15. The method of claim 1, wherein the method is performed with the viewer-effect signal being a motion signal, the motion signal for controlling a motion feedback system in synchronization with the media content.
 16. The method of claim 1, wherein the method is performed with the viewer-effect signal being a visual signal, and further comprising displaying the visual signal in synchronization with the media content.
 17. The method of claim 18, wherein displaying the visual signal in synchronization with the media content includes displaying subtitles.
 18. The method of claim 1, wherein the method is performed with the viewer-effect signal being an additional audio signal, and further comprising emitting the additional audio signal for the viewer in synchronization with the media content.
 19. The method of claim 18, wherein emitting the additional audio signal for the viewer in synchronization with the media content includes emitting an alternative language track.
 20. The method of claim 18, wherein emitting the additional audio signal for the viewer in synchronization with the media content includes emitting a track for visual impairment.
 21. The method of claim 1, wherein outputting the viewer-effect signal includes outputting a continuous viewer-effect signal.
 22. The method of claim 1, wherein receiving a portion of the media signal includes receiving the portion of the media signal with at least one microphone.
 23. A method for determining if a viewer-effect signal corresponding to a media content is synchronized to a media signal for the media content, the viewer-effect signal for producing an effect for a viewer, the method comprising: receiving a portion of the media signal; obtaining a fingerprint of the received portion of the media signal, namely an obtained fingerprint; retrieving a fingerprint of the media signal at a reference time position of the media content based on a current time position of the viewer-effect signal, namely a reference fingerprint; comparing the obtained fingerprint and the reference fingerprint; based on the comparing, determining if the viewer-effect signal is synchronized to the media signal; and according to the determining, outputting the viewer-effect signal synchronized with the media signal for producing the effect in synchronization with the media content.
 24. The method of claim 23, wherein the reference fingerprint is retrieved from a database comprising reference fingerprints associated with time positions for the media content.
 25. The method of claim 23, wherein the reference fingerprint is retrieved directly from the viewer-effect signal.
 26. The method of claim 23, wherein the method is performed with the viewer-effect signal being a motion signal, the motion signal for controlling a motion feedback system in synchronization with the media content. 